Distribution-free, Risk-controlling Prediction Sets
نویسندگان
چکیده
While improving prediction accuracy has been the focus of machine learning in recent years, this alone does not suffice for reliable decision-making. Deploying systems consequential settings also requires calibrating and communicating uncertainty predictions. To convey instance-wise tasks, we show how to generate set-valued predictions from a black-box predictor that controls expected loss on future test points at user-specified level. Our approach provides explicit finite-sample guarantees any dataset by using holdout set calibrate size sets. This framework enables simple, distribution-free, rigorous error control many demonstrate it five large-scale problems: (1) classification problems where some mistakes are more costly than others; (2) multi-label classification, each observation multiple associated labels; (3) labels have hierarchical structure; (4) image segmentation, wish predict pixels containing an object interest; (5) protein structure prediction. Last, discuss extensions quantification ranking, metric learning, distributionally robust learning.
منابع مشابه
Distribution Free Prediction Sets.
This paper introduces a new approach to prediction by bringing together two different nonparametric ideas: distribution free inference and nonparametric smoothing. Specifically, we consider the problem of constructing nonparametric tolerance/prediction sets. We start from the general conformal prediction approach and we use a kernel density estimator as a measure of agreement between a sample p...
متن کاملDistribution Free Prediction Bands
We study distribution free, nonparametric prediction bands with a special focus on their finite sample behavior. First we investigate and develop different notions of finite sample coverage guarantees. Then we give a new prediction band estimator by combining the idea of “conformal prediction” (Vovk et al., 2009) with nonparametric conditional density estimation. The proposed estimator, called ...
متن کاملOnline Learning for Distribution-Free Prediction
We develop an online learning method for prediction, which is important in problems with large and/or streaming data sets. We formulate the learning approach using a covariance-fitting methodology, and show that the resulting predictor has desirable computational and distribution-free properties: It is implemented online with a runtime that scales linearly in the number of samples; has a consta...
متن کاملDistribution-free Prediction Intervals in Mixed Linear Models
This paper considers prediction intervals for a future observation in the context of mixed linear models. For such prediction problems, it is reasonable to assume that the future observation is independent of the current ones. Our approach is distribution-free, that is, we do not assume that the distributions of the random effects and errors are normal or specified up to a finite number of para...
متن کاملSum-Free Sets and Related Sets
A set A of integers is sum-free if A\(A+A) = ;. Cameron conjectured that the number of sum-free sets A f1; : : : ; Ng is O(2 N=2). As a step towards this conjecture, we prove that the number of sets A f1
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of the ACM
سال: 2021
ISSN: ['0004-5411', '1557-735X']
DOI: https://doi.org/10.1145/3478535